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ABSTRACT 

The inverse power law distributions are used as the model for fractal probability 
distributions that have fractional exponents [A] and such that the transformation 
X(l-A) is uniformly distributed. The paper examines aspects of point estimation 
and tests of hypotheses about statistical fractals. It is shown that the maximum 
likelihood estimator of the fractional dimension A is uniformly minimum variance 
unbiased estimator (UMVUE) using the Lehmann-Scheffe's theorem and also 
that the likelihood ratio test for HO: A= AO is uniformly most powerful (UMPT) 
by the Neymann-Pearson Lemma. The paper likewise explains that the test for 
equality of two medians of two fractal distributions is equivalent to a test for 
the equality of the fractal dimensions. In fact, the result is generalized to the test 
for the equality of two ath quantiles of two fractal distributions. The test for the 
equality of two fractal distributions is compared with the Mann-Whitney U test 
and with the Student's t-test for independent samples in terms of robustness and 
asymptotic Pitmann efficiency. 
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I. INTRODUCTION 

In an earlier paper, Padua, Barabat, Borres 
& Salazar et al. (2013] considered statistical 
descriptive measures for fractal observations. 
The authors suggested the use of the a th quantile, 
x~_a, as a replacement of the (non-existent) mean 
of fractal observations, and the ruggedness index, 
I_R, for the corresponding (non-existent) notion 
of variance. Evidence of the inappropriateness 
of normal distribution theory in dealing with 
the inherent ruggedness and irregularities of 
real-world data are mounting (e.g. Mandelbrot, 
1967; Devaney, 1988, Selvam, 2008) so that a 
corresponding fractal statistical analysis theory 
needs to be developed. We pursue this objective 
in this paper by establishing the framework for 


statistical inference of fractal observations. 

Classical statistical inference makes use of the 
sample mean x as an estimate of the true mean 
(^T), whenever the latter exists, for mathematical 
simplicity (Selvam, 2008). The sample mean,^, 
is an unbiased estimator of M whose variance 
decreases with increasing sample size (n). 
Moreover, for large n, x n behaves like a normal 

random variable with mean, fi, and variance,—. 

_n 

Indeed, this mathematical tractability of x is 
enshrined in the famous Central Limit Theorem 
as postulated by De Moivre which is often cited 
as a justification for using the simple average in 
most statistical analysis. When the mean (g) does 
not exist, however, the entire classical statistical 
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inference based on normal distribution collapses. 

However, most situations or phenomena 
in the real world involve a mean that does not 
exist such as earthquakes, stock market crashes, 
student test scores, and the like (Selvam, 2008]. 
It is for this specific reason that statistical 
fractals are developed. This branch of statistics 
is in its infant stage despite a long history of 
recognition that most data/observations do not 
subscribe to a normal probability distribution. 
Indeed, mathematical statisticians attempted to 
fit an otherwise highly rugged, self-similar and 
irregular set of observations to a comfortable 
and safe normal theory. For instance, in the 
1990's, large or unusually small observations 

[I. Fractal Statistical Inference 


were treated as "outliers" or deviants which can 
be treated by Robust Statistics. The notion that 
these outliers were not deviants but really part 
of a usual phenomenon came much later. In fact, 
in an attempt to put Robust Statistics as part of 
classical statistics, some proponents suggested 
"throwing away" the outliers (see Huber [1981] 
for an excellent account of Robust Statistics]. 

Stochastic self-similarity at various scales 
is a notion that most probably account for most 
of natural processes. Such a view provides new 
lenses from which to study cold data in order to 
extract information lost by the smoothing process 
of classical statistical inference. 


Let J be the class of power law distributions parameterized by 6, 6 > x for all x, and A, thi 
fractal dimension: 


-A 


^1] ...J = fx'. [0, oo] -> R + \fx(x) = - (^f) . Let SI = {(0, A) Q R+] be the parametric domain, SI S 

R 2 . Let x lf x 2 ,... ,x n be a random sample from f\(x-,6) E F. On the basis of the informatioi 
:ontained in x lt x 2t ... ,x n , we wish to develop tests of hypotheses about A and 6. 


II.1 Point Estimation 


The likelihood function L(x; 0) can be derived as: 


2)... l = nr=i/;iOi;0) = 


A-l 

~e 


in 


n?-i (?)'*• 


The corresponding log - likelihood function is: 


[3) ...Log L = nlog^j- - ^ =1 log (jj . 

The maximum likelihood estimators [MLE] of A and 6 respectively are: 

[4) ... X = 1 + n [xr=i log (^)] 


6 = min{x 1 ,x 2 ,... ,x n }, 
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Properties of the Estimators A and 6 


We show that the MLE for A is unbiased for A. 


Proof: Take n = 1 and 1 = 1 + 


= 1 +e ( 4 i>) 


l °9 G,j 


-i 


and so: 


= 1 + 


U-l) f OO (|) 


-A 


o J o 


f 




dx 


Letu = log (^^j, x = 9 e u and dx = 9 e u du 

S /V X ^ OO 

^(l) = 1 + (A — 1) f Q —-— du 


Now: 


J°°--du = f°°u ^ du. 

* , 0 u 


We recognize the last integral as a Gamma function with a = 0, /3 = Thus: 

C^- du = C u ~ le ~ (A ~ 1)u du = = i. 

J 0 u J 0 l 

Alternatively, we can use Complex Integration to show this last statement. 
Hence: 

E(X) = 1 + (A - 1) = A. 


Now, for a sample of size n, E(l) = 
becomes: E(X) = 1 + n ■ ^ = A. 


1 + n 


E{zUiog (§)")] 


Since A is the MLE of A, it possesses other 
desirable properties, namely that it is consistent 
and also asymptotically normal with mean A and 

i 

variance equal to ; , where j(A) is the Fisher's 
information number. Likewise, since it is a function 


of sufficient statistics, we are only required to 
show completeness of the distribution of X to be 
able to use Lehmann Scheffe's Theorem to prove 
that it is UMVUE. 
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Theorem 1: The random variable X from a fractal distribution comes from a family of complete 
distributions. 


, 1 - 


Proof: 

Let g(x) 

-A 


e 


be any function of x such that E(g(x)) = 0. Then: E(g(x)) = 
f e g(x) G) dx = 0. This reduces to: J^^^-dx = 0. For the improper integral to exist, 


g(x)a O where 8 > 1 — A and so: J 0 °° dx = 0. Hence: 


l-A-S x A+5 “ 1 


oo 

e 


= o. 


The last equation implies that: 

1 

qA+S -1 “ ® 

Further implying that 8 -* oo since B is 

finite. It follows that g(.x = ^ 0 and thus: 
g(x) = 0 a.e. so X comes from a complete family 
of distributions. 

By the Lehmann-Scheffe's theorem, we 
conclude that A must be a uniformly minimum 
variance unbiased estimator of A ( U.M. V. U. E.): 

Theorem 2: Let A be the MLE of A. Then, A 
is function of complete sufficient statistics; A" is 
unbiased for A and so A is the uniformly minimum 
variance unbiased estimator (UMVUE) of A. 

Proof: 

Since A is the MLE of A, it is always a function 
of sufficient statistics. Theorem 1 shows that it 
is complete while we have shown that E(A )=A. It 
follows from the Lehmann Scheffe's theorem that 
A is UMVUE of A. 

The likelihood ratio test (LRT) statistic: 


Before performing any of the statistical test 
of hypotheses suggested in this paper on a set 
of observations, it is expedient to ascertain that 
the observations indeed come from a fractal 
probability distribution. To do this, we use 
Theorem 2 of the paper of Padua et al. [2013] 
which required: (a.) that the maximum likelihood 
estimator of the fractal dimension be fractional 
(non-integral), and (b.) that the distribution of 
X(l-A) be uniform. We recommend the analytical 
representation of a Q-Q plot to test the latter (see 
Ryan-Joiner test). 

The simplest situation in statistical inference 
is when we are interested in testing: Ho: 
A=A_0 versus Ha: A^A_0. For instance, we may 
be interested in determining whether or not 
x_l,x_2,...,x_n are non-fractal observations viz: 
Ho: A=0; Ho: A=l; etc. If they are not, then we can 
use classical statistical inference in pursuing the 
study. 


(5) ... ¥ = 


supL(x; A) 
A 


L(x ; Aq) 


rejects Ho if ^ > 1 or equivalently, where log'V > 0. However, supL(x;A ) = L(x;l) where 

A 

A 

maximizes L(x; X) and A is the MLE. It follows that the likelihood ratio test (LRT) becomes: 




> 1 


( 2 - 1 ) f Xl \ A 0-* fx n \ A °~ A 

l -o7I)'W •"IvJ 
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or equivalently: 


(7) logL = log 


(A-l) 

(Aq —l) 


+ 


(4 - A) S" =1 log (J) > 0 


Let Z = 4-^, then the left - hand side of (7) becomes: 

z = p-^g-,°»e) <z 

1 J log(A-l)-log(A 0 -l) 

with a decision criterion that says, "reject Ho iff z < z a , z a is the (1 — a) th quantile of 
the distribution of z under Ho." We note that Z is close to zero when Ho is true. 

In order to derive the null distribution of z, we can use a simpler form of (8) : 
(9)... z = (a- a 0 ) 2" =1 log (j) 

We find the distribution ofy = (A — A 0 )log first: 

Theorem 3: The null distribution of the random variable 

Uo-l) (x\- X * 


y = (A - A 0 )log if /*(*) = 


e \e. 


is: 


g(y) = ^o) e (1_Ao)(y 1} - l <y < 


an exponential distribution. 


Proof: 

From: 

Rewrite to: 

y = (1 - A 0 ) £ log (j) + n 


y = (A - A 0 ) ^ log 


y — n = r'Zlog Oj) where r = (1 — /l 0 ) 


Let n=l: 


y — 1 = r log (|) 
<■>» (?) = T 1 


y-i 

x = 6 e r 
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dx q y_ _^ 

So that the Jacobian of the transformation is: — = - e r 

dy r 

It follows that: 


-X 


Zz 1 

;y — 1 \ I 6e r 

giy) = I— —II —-— i -~e r 


e 


6 


e y -1 

r 


giy) = 


a-D 


a-D 


e r 


(y-i) 


g (y) = ^H e -^h<y-^ 

(1-A 0 ) 


The moment - generating function of y is: 


(10)... m y (t) = 

and so the moment - generating function of z = ~Yi =i ^ where y 1; ..., y n are iid from (10] is: 


(11)... m z (t) = (—^) 


n 


which is the moment-generating function of a Gamma random variable with a = n and /? = -. 


It follows that the null distribution of z is: 


(12] h(z) = ^=rr — z n 1 e r z ,z > 0. 

n r(n) 

The (1 — a) th quantile of z can be obtained from (12]. 

Relationship Between - Fractal Dimension (A) and the Median (ju). 

In this section, we show that if A 1 > A 2 then > jl 2> that is observations with higher fractal 
dimension also have higher medians. 

Let x d then: 

m) = \ or 1 - (f) 1 " = 

(13) .../i = 2 a ~ 1 6 

It follows from (13) that if A 1 > A 2 , then 2 Xl ~ 1 > 2 Xl ~ 1 so fi 1 > jl 2 is equivalent to a test for the 
equality of the median of two fractal distributions. 
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Theorem 4: The likelihood ratio test (LRT) 
for the equality of two fractal dimensions is 
equivalent to a test for the equality of the median 
of two fractal distributions. 

Alternative Non-Parametric Test 

When the interest is on comparing two 
[2] probability distributions (rather than their 
roughness], we can use a non-parametric method 
called Mann-Whitney U test. The Mann-Whitney 
U test is a non-parametric method for testing 
hypothesis that one of two samples of independent 
observations tends to have larger values than the 
other. 

Important assumptions necessary for a very 
general formulation of the Mann-Whitney U test 
is discussed at length in the papers of Nachar 
(2008], McKnight & Najab (2010], and Fay & 
Proschan (2010]. 

From these assumptions, a convenient 
formula for this test can be derived as follows: 

The ranks will be added for the observations 
from the sample 1. The sum of ranks in sample 2 
will then be determinate, because the sum of all 
the ranks is equal to N(N + l]/2 where N is the 
total number of observations. 

U is now given by: 

where nl is the sample size for sample 1, and R1 is 
the sum of the ranks in sample 1. 

Note that it does not matter which of the two 
samples is considered sample 1. An equally valid 
formula for U is 

The smaller value of U1 and U2 is the one 
used when consulting significance tables. The 
sum of the two values is given by knowing that R1 
+ R2 = N(N + l]/2 and N = nl + n2 , and doing 
some algebra, we find that the sum is comparable 
to Student's t-test and to the Likelihood Ratio Test 
for Fractal Dimensions. 

The U test is useful in the same situations as 
the independent samples Student's t-test, and the 
question arises of which should be preferred. 

Ordinal data. Choosing the U remains the 
most logical action when the data are in the form 
of ordinal measurement and not interval scaled, 


for the purpose that the spacing between adjacent 
values will not be assumed as constant (Motulsky, 
2007]. 

Robustness. Since Mann-Whitney compares 
the sums of ranks, it is less likely to falsely indicate 
significance than the t-test because of the presence 
of outliers. It means that Mann-Whitney is better 
than t-test in terms of robustness as a statics 
test. This holds as well for the proposed test for 
the equality of fractal dimensions i.e. the test for 
equality of fractal dimensions based on the MLE 
of? is less affected by outliers (Motulsky, 2007]. 

Efficiency. MWW has an (asymptotic] 
efficiency of or about 0.95 when compared to the 
t test if the data distribution adheres to normality. 
For distributions that do not follow normality and 
for those with large sized, the MWW can better 
than t in terms of efficiency (Mann, & Whitney, 
1947] 

When non-normality holds and the 
distributions are actually fractals, the MWW has 
an asymptotic efficiency of ttt~t relative to the 
test based on fractal dimensions (FDT). Note that 
MWW can be more efficient than the test based on 
fractal dimensions for low dimensional fractals but 
can also be quite inefficient for high dimensional 
fractals. In this case also, the asymptotic efficiency 
of the test based on fractal dimensions (FDT] is 
100% relative to the t-test (note that the mean 
and variance do not exist in the case of fractal 
distributions]. 

In the event that the distributions involved are 
actually fractal distributions, MWW and FDT are 
competing tests and it may, in fact, be more reasonable 
to use the FDT for higher fractal dimensions than the 
Mann-Whitney test. 

Kruskal-Wallis-Like Test for Fractal Distributions 

The Kruskal-Wallis test assumes that the shapes 
of the distributions are identical and are scaled for 
each group, unless there are any differences in the 
medians (Gargantini & Fraser, 2011). In the case of 
random variables distributed according to a fractal 
distribution, the scales are different (but the random 
pattern persists for each scale). For this reason, we 
perform the log-transformation: 

Y.. = log(X../0), i = 1,2, ...n,, j = l,2,..g prior to 
undertaking a Kruskal-Wallis test. 
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